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Background . Since the human eye employs three primary colors, and the 
Thematic Mapper returns seven bands of data, one obvious problem that arises 
In making color composite Images Is the choice of bands. The choice In 
non-trlvlal, since three bands can be selected from seven In 35 ways. Also, 
any band can be assigned any color. This gives a total of 210 different 
possible color presentations of TM three-band Images. In this note we present 
a way of reducing that 210 to a single choice, decided uniquely by the 
statistics of a scene or subscene, and taking full account of any correlations 
that exist between different bands. 

We should remark here that one well-known and widely used approach to 
this problem of choice Is through the use of principal component Images. 
However, such methods offer a new problem as great as the one that they solve. 
For although the first three principal components contain In a statistical 
sense as much Information as can be presented using three colors, the 
resulting scene Is completely data dependent. It Is thus difficult for an 
Interpreter to apply any previous experience of color-surface relationships to 
the analysis of ?, principal components Image. 


Definition of the method . Consider the 7x7 variance-covariance matrix M for 
the scene or subscene. Ignoring for the moment the fact that the thermal band 
Is of Inherently lower resolution than the rest. Any triplet of bands will be 

represented within this 7 x 7 matrix by a 3 x 3 submatrix. 
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Considering now the 3-d l mens l on$ I subspace spanned by any particular band 
triplet, the associated varltnce-covar lance matrix defines an ellipsoid within 
the subspace. Further, the sum of the squared principal axes of this 
ellipsoid represents the total variance accounted for by these three bands 
(see Figure 1). One could plausibly (but as we shall see, wrongly) argue that 
the best three bands are those with the largest sum of squared principal axes, 
and hence accounting for the largest total variance. This Is, after all, 
exactly the argument applied In employing principal component Images. Since 
the trace of a matrix Is Invariant under rotational transformations, and since 
the sum of squared principal axes Is equal to that trace, the band triplet 
that accounts for the most possible variance can be found from the original 
variance-covariance matrix simply by selecting the three bands with the 
largest diagonal elements. There Is no need to examine all 35 band 
combinations. 

To see what Is wrong with this approach, consider an extreme case where 
there happens to be perfect correlation between a pair of bands. For 
convenience, suppose that those bands are 1 and 2, and suppose that the 
variance of band 1 (and therefore of 2) Is larger than that of any other band. 
The 7x7 matrix M then has the form: 



where a>b,c, . . . 
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The rotation matrix that wit I diagonalize the upper left 2x2 submatrix 
then has the form: 



and thus after rotation the upper left 2x2 submatrix will have the form: 



As expected, one eigenvalue is zero; but the other Is the sum of the variances 
from the original bands 1 and 2. Since a is assumed to be large, both bands 1 
and 2 will be Included In the triplet that accounts for maximum variance — 
despite the fact that if either one of them is used, adding the other 
contributes no new Information. 

The problem lies In the use of total variance as the measure for the 
Information content of the band triplets. This is equivalent to use of the 
sum of squares of ellipsoid principal axes, and there Is no penalty associated 
with a very small principal axis provided that It occurs In association with a 
large axis (see Figures 2 and 3), as was the case for the above example. 

We propose the use of a new measure for the Information content of the 
triplet, and one that avoids the undesirable property demonstrated above. We 
will select the ellipsoid of maximum volume . This discourages selection of 
pairs of bands with high correlation, since In such cases one eigenvalue will 
be close to zero and the corresponding ellipsoid volume will be small. 

Since the ellipsoid volume Is simply 4/3.TTabc, where a, b, and c are the 
principal axes of the ellipsoid, the volume of the ellipsoid associated with a 
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particular band triplet Is a constant multiple of the square root of the 
product of the eigenvalues for the 3x3 var lance-covartance matrix of that 
triplet. However, und£)r rotational transformation the product of the 
eigenvalues Is equal to the determinant of the original 3 x 3 ‘'ubmatrlx. Thus 
we can select the band triplet that provides the ellipsoid of maximum volume 
simply by computing and ranking In order the determinants of each 3x3 
principal submatrix of the original matrix M. The band triplets associated 
with these determinants will then be ranked In order of decreasing overall 
Information content. Given the original matrix M, the total computation to 
achieve this ranking Is trivial. It requires a few hundred multiplications, 
followed by a sort of a list of 35 Items. A BASIC program to perform this Is 
given as an Appendix to this note. 

This procedure gives the best triplet, but the asslgment of colors Is 
still to be made. Now we can make use of the actual variances (the diagonal 
elements of M). Since the eye Is most sensitive to green, next to red, and 
least to blue, we will assign green to the band triplet member of maximum 
variance (l.e. most variation within the Image), red to the triplet member of 
second largest variance, and blue to the triplet member of smallest variance. 
The definition of bands for production of a color Image Is now complete. 

Examples and comments . The procedure has been appl led to a number of scenes 
of very different ground cover. Including Washington D.C., Death Valley, and 
Cement, Oklahoma. The results for Washington and for Death Valley are given In 
Tables 1 and 2, together with the associated variance-covariance matrices. The 
following comments apply to all scenes studied to date. 

1) The band combination 1,4,5 (In the order blue, red, green) Is usually, 
but not always, £he selected triplet. In cases where It does not rank first. 
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It ranks second or third, 

2) The natural color combination 1,2,3 and the standard falso color 
combination 2,3,4, both ptaco far down In the rankings. In the case of 
Washington, the natural color combination Is 29th (lower than anything except 
seme thermal band combinations, which are low for another reason to bo 
discussed shortly); the 2,3,4 combination was ranked tn 16th place. For Death 
Valley, the 1,2,3 natural color combination ranked 32nd, and the 2,3,4 
combination Just above It, at 31st, This Is presumably a consequence of the 
very high correlations between the first four bands. 

3) Triplets that rank high always tncludo either bond 5 or band 6 (note: 
the bands here are ordered by Increasing wavelength, so the thermal band Is 
band 7), This emphasizes the great Importance of these new bands on general 
Information-bearing grounds. 

4) The triplet selected Is not always or even usually tho triplet with 
the greatest Individual variances, though largo variances are naturally 
preferred somewhat In the selection process. 

Other .conaJ-decfllLans i-M firninrnls. 

1) The statistical analysis performed here used P topes (all that we had 
available) In which tho original histograms had already been modified by the 
gains and offsets. It would be preferable to work with data that have had no 
gains or offsets applied, J.e., with A tapes prior to any radiometric 
correction. If band selection of this type becomes common. It would be nice to 
have A tapes generally available from tho EROS Data Center. 

2) The thermal band Is of lower resolution than the rest, thus It would 
not be appropriate to give It the same weight In the selection process. How 
should one therefore de-weight It? One argument runs as follows: Tho maximum 
Information that a scene can contain Is given by the number of pixels, since 
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In the ultimate case there would be no correlation between pixels, and each 
would carry Independent Information about some feature of the surface. In such 
a case, the amount of Information that the thermal band can contribute Is only 
1 /16th that of the other bands, because there are 16 times fewer pixels In 
that band. Therefore one should deweight the thermal channel by a factor of 
16. Such deweighting was performed In the experiments reported here. However, 
we should also note that this made no difference at all to the preferred band 
triplets, since even without deweighting we found no case where a triplet 
Involving the thermal channel was In the top five. 

3) It Is obvious when one looks at Images created from the triplet 1,4,5 
that for some applications this combination will be much Inferior to others, 
such as natural color and standard false color. This restates the old truth, 
one man’s noise Is another man’s signal. However, the preferred triplets have 
another advantage: they provide Images of unusual clarity, with far less 
residual striping than Is seen In, for example, the natural color Images. 

4) Although combinations such as 1,4,5 produce Images that are at first 
sight unfamiliar and unusual, the assigned colors are not scene-dependent. 

Thus In contrast to the scene dependent colors of principal component or ratio 
Images, the Interpreter quickly learns to associate colors with particular 
ground condition. We therefore believe that there are definite advantages to 
seeking color composites from the original bands, rather than through band 
ratios or band combi rations. 
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Figure Is 


The variance- covariance ellipsoid, principal axes 
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Figure 2. High correlation. 

bands 1 and 2 . 


Figure 3. Low correlation, bands 
1 and 2, but lower 
individual variances. 



Tv, + large, 
'A, . “Aj. small 



A, + Ajr, smaller than in Fig. 2 
*A ( • A2_l ar 9 er than in Fig. 2 


Based on ellipsoid volumes. Fig. 2 case is preferred over Fig. 1 case^ 
although the former accounts for a greater total variance. 
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APPENDIX; THE BEST-BAND PROGRA M. ORIGINAL PAGE IS 

OF POOR QUALITY 


20 PRINT "SELECTION OF BEST THREE BANDS BASED ON ELLIPSOID VOLUME" 

30 PRINT "DEATH VALLEY WITH REDUCED THERMAL VARIANCE" 

MO DIM R(36),Q(36) 

50 DIM U(36),V(36) 

60 DIM M(8,8) 

70 REMARK: M is the variance-covariance matrix for the scene or subscene. 

80 REMARK: The arrays R,Q,U and V are storage arrays used in the program. 

90 REMARK : Note that the program assumes that band 7 is the thermal data , 
and band 6 is the 2 . 2 micrometre data . 

100 REMARK: The instructions 190 to 230 (except for 220, which sets a count) 
reduce the variance of the thermal channel to allow for the lower spatial 
resolution of the thermal channel pixel. 

190 FOR I = 1 TO 6 
200 MCI, 7) = MCI, 7) / 4 
210 NEXT 
220 C = 1 

230 M(7,7) = M(7, 7) / 16 

240 PRINT "RANK DETERMINANT COMBINATION" 

250 FOR I = 1 TO 5 
260 FOR J = I + 1 TO 6 
270 FOR K = J + 1 TO 7 

280 D1 = MCI, I) * (M( J, J) * M(K,K) - M(J,K) ~ 2) 

290 D2 = M(I, J) * (M(J,K) * M(I,K) - M(I,J) * M(K,K) ) 

300 D3 = M(I,K) * (MCI, J) » M(J,K) - M(I,K) * M(J,J)) 

310 DT = D1 + D2 + D3 

315 REMARK: The next instruction makes the determinant an integer; this is 
not necessary, it is done for convenience of output only. 

320 DT = INT (DT) 

330 N = 100 * I + 10 * J + K 

340 R(C) = DT:Q(C) = N 

350 C = C + 1 

360 NEXT 

370 NEXT 

380 NEXT 

385 REMARK: The next piece of code sorts the determinant into descending order. 

390 FOR I = 1 TO 35 
400 N = 0 

410 FOR J = 1 TO 35 
420 IF R(I) I R( J) THEN 440 
430 N = N + 1 
440 NEXT 

450 U(N) = R(I) : V(N) = Q(I) 

460 NEXT 

470 FOR I = 1 TO 35 
480 PRINT I,U(I),V(I) 

490 NEXT 
500 PR# 0 
510 END 
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SELECTION OF BEST THREE BANDS BASED ON ELLIPSOID VOLUME 

THIS IS THE WASHINGTON SCENE WITH REDUCED VARIANCE ON THE THERMAL CHANNEL 


RANK 

DETERMINANT 

COMBINATION 

1 

433858 

145 

2 

205811 

345 

3 

138551 

146 

4 

124784 

245 

5 

101638 

456 

6 

71723 

156 

7 

62960 

346 

8 

49759 

135 

9 

39992 

134 

10 

39609 

246 

11 

36060 

356 

12 

22847 

125 

13 

21953 

256 

14 

16732 

124 

15 

11646 

235 

16 

9709 

234 

17 

7967 

136 

18 

5094 

457 

19 

4752 

157 

20 

3634 

126 

21 

3606 

147 

22 

2294 

467 

23 

2194 

357 

24 

1945 

347 

25 

1616 

236 

26 

1386 

567 

27 

1348 

257 

28 

1130 

247 

29 

727 

123 

30 

688 

167 

31 

276 

367 

32 

215 

137 

33 

175 

267 

34 

84 

127 

35 

43 

237 
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TABLE l.b Ranked results for Washington D.C. scene . 


SELECTION OF BEST THREE BANDS BASED ON ELLIPSOID VOLUME 
DEATH VALLEY WITH REDUCED THERMAL VARIANCE 
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SELECTION OF BEST THREE BANDS BASED ON ELLIPSOID VOLUME 
DEATH VALLEY WITH REDUCED THERMAL VARIANCE 


RANK 

DETERMINANT 

COMBINE CION 

1 

1462581 

145 

2 

859695 

156 

3 

684248 

135 

4 

601687 

146 

5 

432952 

345 

6 

346425 

157 

7 

328331 

356 

8 

319827 

245 

9 

275534 

456 

10 

263989 

136 

11 

219239 

256 

12 

204146 

125 

13 

167450 

346 

14 

137060 

357 

15 

127643 

246 

16 

121117 

167 

17 

107494 

457 

18 

103781 

235 

19 

89506 

257 

20 

76827 

126 

21 

75913 

134 

22 

49163 

367 

23 

40621 

467 

24 

39230 

236 

25 

37614 

147 

26 

31621 

267 

27 

21579 

137 

28 

21322 

124 

29 

20256 

567 

30 

9168 

347 

31 

8118 

234 

32 

7895 

123 

33 

7197 

247 

34 

5037 

127 

35 

2407 

237 
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Table 2.b Ranked results for Death Valley scene 


